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DOTTED MICRO- ARRAY DATA EXTRACTION METHOD 

CROSS-REFERENCE TO RELATED APPLICATIONS 

The application claims the benefit of provisional U.S. Application Serial 
5 No. 60/261,305, filed January 12, 2001, and titled "GLEAMS: A Novel Approach to High 

Throughput Genetic Micro-Array Image Capture and Analysis," which is incorporated 
herein by reference in its entirety. 

BACKGROUND OF THE INVENTION 

10 Many members of the scientific community believe that genetic 

information lies at the root of many diseases, and that genes are likely to be responsible for 
these diseases, whether the genes are the causes of disease, or otherwise promote disease 
through the encoding of proteins. The Human Genome Project has provided significant 
information on the recipes for proteins. The Human Genome Project has enhanced the 

15 ability to diagnose, if not eventually cure, many diseases. 

It is also well known that knowledge of the raw sequence of the more than 
three million base pairs m the human genome is insufficient in and of itself to permit the 
diagnosis or cure of any disease. Identifying genes requires the mapping of the genome 
into chromosomes and the identification of exons and introns. Identifying genes, however, 

20 is only the beginning; an analysis of the role, purpose and position of a gene within a 

pathway should facihtate an understanding of the causal relationship between genes, the 
illness, and the stage of the development of the illness. 

As part of the development of the human genome projects, scientists have 
recognized the concept of differential gene expression. Gene expression is the process by 

25 which the coded expression of a gene is converted into the structures that are present and 

operating in the cell. The first step in gene expression is transcription, which is the 
process in which mRNA is formed by RNA polymerase to be complementary to a DNA 
sequence. Following transcription, mRNA serves as a template for protein synthesis 
through a process called translation. The expression level of a gene, a measure of its level 

30 of activity in a cell line, correlates with the density of its corresponding mRNA. The 

expression level of any particular gene can vary firom individual to individual and vary 
with the progression of a disease or a treatment in a single individual. The knowledge of 
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an expression level of a gene therefore assists a researcher in determining whether that 
gene responds to a trait, stimulus, or disease. 

Genes can also be investigated at the DNA level, which, as described 
above, creates a complementary template for the transcribed gene. Various types of 
5 genetic alterations can be seen at the DNA level. The complementary DNA template may 

contain an alteration in the complementary template region, thus producing altered 
mRNA. In addition, an alteration may occur outside the coding region, creating an error 
during transcription or translation. Finally, the human genome is assumed to have many 
variable or polymorphic sites known as single nucleotide polymorphisms (SNPs). 
10 Scientists are currently attempting to correlate disease with a set of SNPs in genomic 

DNA. 

The ability to construct micro-arrays from cDNA, hybridize them with 
control and treatment samples, and analyze them, opens the door to comparative research 
and analysis. Gene micro-array technology enables researchers, for example, to measure 

15 simultaneously the expression levels of a large number of genes in a normal tissue sample 

relative to a cancerous tissue sample. By inspecting the gene expression on a micro-array 
for tissue samples of, say, melanomatic and healthy skins, it is possible to identify a group 
of genes responsible or at least related to the disease. Thus, gene micro-array technology 
enables tiie simultaneous measurement of gene expression levels in a normal tissue sample 

20 relative to a test condition, such as, for example, cancerous tissue. 

One of the goals of a micro-array experiment is to simultaneously examine 
the expression of all genes of a specific organism in a cell type in a specific growth or 
stress condition. Micro-array technology requires the positioning of minute amounts of, 
for example, DNA representing specific genes, onto a small glass slide in an ordered 

25 fashion. This produces the DNA micro-array, which can contain, depending on the 
availability of clones from an organism, a specific subset of genes, or all genes. Thus, 
with a single experiment, gene array technology allows researchers to measure 
simultaneously a large number of genes in a tissue sample under normal conditions and 
under test conditions. Micro-arrays can be made by placing hundreds or even thousands 

30 of genes on a glass or nylon subsfrate. The cDNA material for each gene is deposited as a 

small dot onto the array. Because the identity of the cDNA at a particular element in the 
array is known, the identification of expressed genes simply follows a determination of 
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which elements in the array have formed a binding complex with the sample DNA placed 
on the array. 

Li the course of a micro-array experiment, mRNA from two tissue samples 
— one control sample and the other a test sample — are each labeled with a fluorophore. 
5 If a single array is used, each sample receives a spectrally distinct fluorophore. For 

example, the control sample may be labeled with a fluorophore that emits in the green 
region, and the test sample may be labeled with a fluorophore that emits in the red region. 
Likewise, if only a single fluorophore is used for labeling, then multiple arrays — one for 
the test sample and one for the control sample — are required. The labeled samples are 

10 hybridized with the micro-array. Following hybridization, the arrays are washed at a 

chosen stringency to maximize the amount of perfectly complementary material bound to 
the micro-array, and, at the same time, minimize the amount of mismatched material 
boimd to the array. Following washing, the micro-arrays are scanned with lasers that 
excite one or more of the fluorophores. The resulting fluorescence is captured, and the 

15 two fluorescent images are combined to produce a single color image or to generate two 

gray-scale images, one for each tissue sample. The two images can also be considered as 
two channels of a color image, which is known as a two-color fluorescence image. The 
experiment described above can also be done with a single tissue sample. In either case, 
the intensity of a captured fluorescence at each element in the micro-array correlates to the 

20 expression level of the corresponding gene in the tissue used to produce the resulting 

image. 

In recent years, micro-array technology has become a very important tool 
in the analysis of gene expression and SNP studies. The development and use of 
micro-array technology has exploded in the past few years. Improvements have occurred 

25 in the areas of hardware, biological assays, and data quality. The rate of growth of 
micro-array use has increased, creating an ever increasing mound of data for analysis. 
Associating each micro-array element with a known grid location is currently an 
inefficient task. For example, this process typically requires manually locating each dot in 
the micro-array and then determining the coordinate location in the x and y direction for 

30 every element in the micro-array. The process of selecting, identifying, and providing a 
coordinate location for every element in the micro-array becomes unmanageable when 
there are hundreds or thousands of elements in the micro-array and when there are dozens 
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or hundreds of micro-arrays. The greatest challenge is to analyze large amounts of data in 
an automated, high-throughput fashion. Any such analysis system will require a 
computer, digital signal processing (DSP), and analytical tools. Few integrated systems, 
^d even less software, is available to solve this problem. 
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SUMMARY OF THE INVENTION 

The invention disclosed herein is an automated method for locating the 
sub-arrays of a micro-arrays and for locating within each sub-array the dots of each 
sub-array. For each micro-array image, the orientation and the lattice constant of each 
5 sub-array is calculated. After the orientation of the sub-arrays are determined, the 

sub-arrays may be rotated so that each sub-array is aligned with the rectihnear direction of 
the rows and columns of the micro-array. Regions of the micro-array image are compared 
to sub-array profiles to identify the sub-array regions of the micro-array. Once the 
location of each sub-array is determined, an estimate of the location of each dot is 

10 determined on the basis of the lattice constant. 

A search of each sub-array is conducted to locate the dots of each sub- 
array. As part of this search, objects are located that are approximately round in size and 
are a collection of several pixels. Once the dots of the sub-arrays have been located, a 
two-dimensional vector field is constructed. The vector fields represents a 

15 two-dimensional displacement value of actual dot placement, as shown by the search for 

dot-like collections of pixels, versus the estimated dot placement. From these 
displacement values, an estimate of the position of each dot can be calculated. A 
constraining shape mask is constructed, and a segmentation method is applied to detect the 
location of each dot within the collection of pixels. The constraining shape mask is 

20 applied to each detected dot, limiting the boundary of and precisely identifying the 

location of each dot. 

A technical advantage of the disclosed invention is an automated process 
that precisely identifies the location and boundary of each dot of each sub-array of a 
micro-array. Because of the method disclosed herein, a previously labor-intensive task of 

25 identifying each sub-array and dot has been simplified through • automation such that 

hundreds of sub-array images may be processed through the method disclosed herein. 
Another technical advantage of the present invention is a method in which the process of 
locating each sub-array and dot is conducted with some degree of mathematical precision. 
Rather than employing a manual and time-consuming process of the individual location of 

30 sub-arrays and dots within, the method disclosed herein accomplishes this process 
according to a precise, automated method. 
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BRIEF DESCRPTION OF THE DRAWINGS 

A more complete understanding of the present embodiments and 
advantages thereof may be acquired by referring to the following description taken in 
conjunction with the accompanying drawings, in which like reference numbers indicate 
5 like features, and wherein: 

Figure 1 is a graphical representation of a micro-array; 

Figure 2 is a flow diagram of method steps for orienting a micro-array 
image and locating the sub-arrays of a micro-array; 

Figure 3 is a flow diagram of method steps for estimating the position of 
1 0 each dot of each sub-array; and 

Figure 4 is a flow diagram of method steps for estimating the detecting and 
delineating the position of each dot of each sub-array. 
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DETAILED DESCRIPTIQN OF THE INVENTION 

Shown in Figure 2 is a graphical representation of a micro-array 100. In a 
typical micro-array, the dots 102 are organized into one or more sub-arrays, which are 
shown in Figure 2. The sub-arrays 104 are arranged in an arrangement of four columns by 
5 two rows. In the example of Figure 1, the dots of each sub-array are arranged in a 

rectangular grid of four columns and five rows. Often, sub-arrays are themselves arranged 
in a rectilinear fashion to form the micro-array. The most easily obtained and often the 
only reliable information about the image of a micro-^ay, before it is visually examined, 
is the spotting geometry of the micro-array. The spotting geometry of the micro-array is 

10 the number of rows and columns of the dots in each sub-array and the number of rows and 

columns of the sub-arrays. The method described herein uses as inputs the spotting 
geometry, together with the images tiiemselves. The method described herein does not 
assume the size of the dot of a micro-array and does not assume that the distance between 
neighboring dots is known. Similarly, the method described herein does not assume that 

15 every dot in each sub-array is present or that the grids formed by the dots are highly 

regular. The method described herein is able to process a large number of micro-array 
images because the method involves a top-down approach of discerning the sub-arrays 
before detecting and analyzing the dots of the sub-arrays. 

The method described herein involves orienting the micro-array image and 

20 locating the sub-arrays of the micro-array, an optional step of refining the dot placements 

within the micro-arrays, and dehneating the extent of each dot. With respect to the step of 
orienting the micro-array image and locating the sub-arrays of the image, an automated 
method detects the orientation of the array grid and locates the sub-arrays within the 
image. With respect to the optional step of refining the dot placements within the 

25 micro-arrays, the grid points within each sub-array are shghtly adjusted to account for any 
deviation fi-om regularity in the sub-arrays. With respect to the step of delineating the 
extent of each dot, a threshold-based object detection method followed by an optional 
shape manipulation step is employed to delineate the boundary of each dot, thereby 
separating signal pixels fi-om background noise or other signals. 



30 
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I. Image Orientation and Location of Sub-Arrays 

A flow diagram of the method steps for spatially orienting the micro-array 
image and locating the sub-arrays of the micro-array is shown in Figure 2. With respect to 
the step of orienting the micro-array image and locating the sub-arrays of the micro-array, 
5 the dimensions of the sub-arrays are estimated. Following an estimation of the dimensions 

of the sub-arrays, a template sub-array, representing a sub-array having a set of identical 
dots perfectly positioned in the sub-array is used to detect ttie actual location of the 
sub-arrays. At step 202, the lattice constant and the orientation of the sub-arrays are 
determined. If the number of rows and columns of a sub-array are known, the size of the 
C= 10 sub-array can be determined by the lattice constant, which is the average distance between 

centers of neighboring dots. If it is the case that the location of none of the dots of a sub- 
array are known, the lattice constant can be estimated by examining the periodicity of 
r- structures within the image. 

With respect to examining the periodicity of structures to determine a 
"~ 15 lattice constant, an approach for determining the lattice constant is to identify peaks in a 

2-D periodogram of the image. Within the 2-D periodogram, the lattice constant can be 
-'^ determined by measuring the distance between the two strongest peaks of the 

periodogram. Using this approach, the orientation of, i.e. the angle formed by, a vector 
from the coordinate origin to the closest peak provides the angle of rotation of the grid 
20 array. Any noise introduced by false peaks can be at least partially overcome by using an 

averaged periodogram, which can be calculated by dividing the image into overlapping 
blocks and averaging the periodograms computed from each block. 

The lattice constant of the sub-arrays can alternatively be determined from 
the 2-D auto-correlation function of the micro-array. Starting from the origin of the 2-D 
25 auto-correlation function, the first peak that hes within a few degrees of the +x direction is 

identified. The distance from the origin to this peak is equal to the lattice constant in the x 
direction, and the orientation of the peak is the orientation of the sub-array. A similar 
measurement can be used to determine the lattice constant in the y direction. Because of 
the averaging nature of the auto-correlation function, the noise level of this analysis is low, 
30 diminishing the possibility of false peaks. Although the peak produced by the auto- 
correlation fimction may be broad, the step of fitting a curving surface around the peaks 
and calculating the apex of the surface can be used to estimate accurately the location of 
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each peak. In the case of images with very high levels of noise and irregularity in the dot 
shape and placements, the peaks can be so broad that some of the peaks merge into the 
primary peak at the origin and become undetectable. In this case, the application of a 
smoothing function with an automatically calculated threshold may sufficiently enhance 
5 the image that peaks produced by the auto-correlation function become distinct. 

Once the lattice constant and the orientation of each sub-array of the 
micro-arrays are known, the image can be rotated at step 206 so that the sides of the image 
align with the direction of the rows and columns of the micro-array. From this point, it is 
assumed that any misalignment between the x-axis and the rows of the sub-arrays, and the 

10 >'-axis and the columns of the sub-array are negligible. 

A sub-array image typically covers an area on the order of a few himdred 
thousand pixels. As such, a few hundred thousand pixels in the vicinity of each location in 
the image must be examined to determine if the pixels represent a structure resembling a 
sub-array. A template sub-array is created at step 208 that has the number of expected 

15 rows and columns and containing identical round dots spaced according to the estimated 

lattice constant. At step 210, the template sub-array is used as a guide to identify regions 
of the image that resemble sub-arrays. The identical round dots may each have a Gaussian 
profile. If a region is found in the image that resembles the template, it is likely that the 
region is a sub-array. The degree of resemblance between the region of the image and the 

20 template can be measured by the cross-correlation between the template and the region of 

the image. Following the application of the cross-correlation function between the region 
of the image and the template, and assuming that the origin of the template is at the 
geometrical center of the template, each local maximum point of the cross-correlated 
image represents a possible location of the geometrical center of a sub-array. The number 

25 of such possible locations can be reduced if each local maximum point is discarded when 

it is not the absolute maximum point in an area of a few grid cells around it. 

If fewer possible sub-array regions are found, as compared to the number of 
expected sub-arrays, it is possible that the image has a strong background that drifts in 
amplitude firom one side of the image to the other. To account for this, an automatically 

30 determined locally varying threshold function can be applied to the image before the 
cross-correlation function is computed. As a second alternative, a two-dimensional 
low-cut filter can be applied to the image to remove large-scale trends from the image. By 
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automatically trying different combinations of these or other enhancements, an algorithm 
can be found that will locate the sub-arrays in all but the most problematic images, which 
must be visually examined by the user. 

When more possible images are detected than there are sub-arrays, 
geometrical restrictions can be applied to exclude the false possible images. Possible 
geometric conditions include the rule of including sub-arrays that do not overlap and that 
are organized in approximately rectangular grids, the allowable regularity of which can be 
progressively restrained until only one set of conforming selections is left. As a second 
possible condition, the geometrical constraints can be relaxed, revealing multiple sets of 
conforming locations. Of these multiple sets of conforming locations, the set with the 
largest amplitude sum is selected. If these two conditions — restraining and relaxing the 
geometrical constraints — do not produce the same results, the user is alerted while the 
program proceeds with the result from the restraining methods. 

After the location of each of the sub-arrays is determined, the expected 
location of each individual dot can be estimated on the basis of the estimated values of the 
lattice constant. The procedure for locating the sub-arrays of the image can be modified 
sUghtly to take advantage of added location information for those micro-array images that 
have two color channels. In the case of a micro-array image that includes two color 
channels, tiie process of locating the sub-arrays is first applied to one color channel, 
treating this channel as though it were a single chaimel gray-scale image. If this step fails 
to locate the sub-arrays, the procedure is repeated with the other color charmel as the input. 
If this step continues to yield no definite result, the two channels are summed together to 
form a single gray scale image to be used as the input to the steps of locating the 
sub-arrays of the image. 

II. Refining Dot Placements 

In some micro-arrays, the positions of the dots of the micro-array differ 
visibly from the expected location of the dots if the dots were to fall in place on a regular 
rectangular grid. For the purpose of forming the constraining shape mask, which is 
described later herein, the deviations and the actual centers of the dots are detected. 
Because of the presence of deviations in the dots and the possible presence of unknown 
artifacts in the image, it may not be possible to detect every dot in the micro-array. The 
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method described herein searches for and accepts only those objects that look Hke dots. A 
flow diagram of the method is shown in Figure 3. 

Starting from the center of a grid point within a sub-array, an outward 
concentric search is performed for any objected formed by contiguous pixels that have 
5 intensities that are significantly greater than the background level of the image, as 
indicated by step 302 of Figure 3. A search is performed for objects that are 
approximately round and of a size that is l^ger than a few pixels but smaller than the area 
of the largest circle that can be fit in a grid cell. In many cases, fewer qualified objects can 
be found for each sub-array than there are dots in the sub-array. A twenty percent (20%) 

10 trimmed mean of the area of all such objects is calculated, and any object whose area 

differs by more than a factor of two fi-om this value is rejected. In performing this 
analysis, the requirement that neighboring dots should not overlap or touch is not imposed. 

At step 304, for objects that satisfy the conditions of the outward concentric 
search, the displacements of the expected positions to their actual positions are calculated, 

15 forming for each sub-array a two-dimensional vector field. The vector field will include 

gaps in the data for those gaps where the displacement is undetermined. A 
two-dimensional moving trimmed filter is applied to each component of the field as part of 
step 306, and gaps in data of the field are filled with data achieved by interpolation. 
Applying these displacements to the regular grid positions yields an estimate for the 

20 position of every dot, as indicated by step 308 of Figure 3. For images that are 

two-channel images, a displacement field is calculated on the basis of and from each 
channel. The channel that yields the more acceptable objects is used as the final result. 

III. Target Detection and Delineation 

25 Some dots in the image may suffer from "bleed-over," which is 

characterized by a dot that lacks a fully enclosed and clearly defined boundary. A 
constraining shape mask can be applied to impose a restraint on the maximum allowable 
size for a dot. A constraining mask is formed for each sub-array so that depositing pins 
with different physical characteristics can deposit one or more of the sub-arrays of the 

30 micro-array. The step of constructing a constraining shape mask is shown as step 402 in 
Figure 4, which depicts the steps of delineating and detecting each dot of each sub-array of 
the micro-array. The shape mask is constructed from a stacked or summed image of the 
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strongest of the dots in the sub-array. To determine the relative strength of a dot, a 
measurement can be made of the average intensities of the pixels within the grid cell. For 
this calculation, the grid cell of a dot is the rectangular area centered at the dot and with 
sides equal to the lattice constants. For the purpose of this analysis, the strongest dots of 
5 the sub-array can be considered to be the strongest 10% of the dots of the sub-array. As an 

alternative, the strongest dots of the sub-array can be considered to be all those dots above 
a certain threshold, with the threshold being calculated from an estimation of the 
background mean and variance. The dots that are determined to be strongest dots of the 
array are stacked together and summed. 

10 Because the centers of the dots have not been determined with certainty, the 

dots may be misaligned when stacked, causing the summed image to be slightly blurry. 
The process of summing the images will also reduce the relative strength of the 
background noise variance. From the summed image, a threshold can be computed by, for 
example, the method disclosed by Otsu's method, which is described in Otsu, N., "A 

15 Threshold Selection Method from Gray-Level Histograms", IEEE Transactions on 

Systems, Man, and Cybernetics, 9(1): 62-66, which is incorporated by reference herein. 
The shape mask is constructed by dilating by one pixel the object formed by the pixels 
above the threshold in the stacked image. The blurriness of the simmied images and the 
dilation operation cause the shape mask to be larger than most dots in the array. If a dot 

20 that is detected in a later step does not fit within the shape mask, it is likely that the dot 
suffers from bleed-over, and the extent of the dot should be constrained to stay within the 
mask to prevent inclusion of background pixels into signal strength calculations. For 
two-channel images, strong dots from both channels can be used to produce the stacked 
image. 

25 As part of step 404, each dot is detected according to a segmentation 

method. A threshold based segmentation method may be used to detect individual dots. 
For each dot to be detected, Otsu's method is used to compute a threshold from the 
histogram of the pixels within the dot's grid cell after the application of a median 
smoothing fimction to the image. The computed threshold is constrained to fall within a 

30 range determined from an estimate of the mean of the local background and variance. 

This background estimation is made on the basis of pixels that are outside the largest 
circles that can fit within the grid cells. Background pixels from a 5-by-5 grid cell area 
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around each target dot are included for the analysis of that target dot. As an example, with 
a lattice constant of 25 pixels, there are about three thousand pixels included in the 
background estimation for each target dot. It has been found that the range of (\i+3g, 
(x+lOa) to be a reasonable choice as a constraint on the threshold selected by Otsu's 
5 method, where (x and a are the mean and variance, respectively, of the background. 

Segementation methods that are based solely on the threshold analysis rely 
on the intensity of a pixel when determining whether the pixel represents a signal or 
background. These methods can be augmented by morphological operations that merge 
unconnected regions and smooth the contours of detected objects based on positional 

10 information. The hysteresis threshold method is a segmentation method that includes both 

an analysis of pixel intensity said morphological operations. The hysteresis threshold 
method uses a lower threshold for pixels that are connected to pixels above a higher 
threshold. The application of a constrained Otsu threshold followed by some minimal 
amount of morphological operation can achieve better results for most micro-array 

15 images, as compared with the application of a segmentation method that involves only 

intensity analysis. 

The detected object is compared against the constraining shape mask, as 
indicated by step 406 of Figure 4. When the object does not fit entirely within the mask, a 
best fit may be found by shghtly sliding the mask around the area of the object. When a 

20 best fit is found, pixels that fall outside the boimdary of the constraining shape mask are 

dropped. For two-channel images, the union of the two objects detected in the two 
channels are used as the final result, thereby reflecting the fact that the distributions of the 
two types of dyed genetic material that generate the image in each channel could differ. 
Because a dot is defined to be the area occupied by material deposited on the micro-array, 

25 the presence of one type of dyed material in a certain region is sufficient indication that 
the region is a part of a dot. For this reason, calculation of signal strength for either 
channel is carried out over all pixels in the union. 

In addition to the noise generated by the random variation of the 
background image, confetti-like noise caused, for example, by large particles of 

30 contamination may also be present on the image. Such artifacts, typically in the form of a 

small group of high intensity pixels, are excluded firom any signal calculations. On the 
basis of the assiraiption that all normal pixels belonging to a single dot are relatively 
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homogeneous in terms of intensity, statistical outliers can be separated from normal signal 
pixels. The separation of the outlying artifacts can be accomplished by a second 
segmentation process that segments out pixels that are included as part of the signal during 
the first segmentation process. For two-channel images, any pixel detected as noise in 
5 either channel must be excluded from the signal calculations in both channels. 

After the boundary of each dot is determined, signal and background 
statistics can be determined for each dot. Because the signal pixel intensities do not 
follow any simple random process model, a trimmed mean is the most appropriate 
measxu-e of an estimate average intensity. The background pixel intensities can be 

10 modeled by a gamma distribution, although artifacts and other statistical outliers must first 

be excluded before the fitting of the gamma distribution. Background pixels from 
neighboring grid cells are typically included in the computation for each dot. Quality 
metrics that measure the signal against the background image, the shape and strength of a 
dot against the shape and strength of other dots in the array, and the local background 

15 against the background in other regions may also be applied to the dot computation. 

The method disclosed herein is an automated technique for precise 
coordinate location in the x axidy direction of each element in a micro-array. The method 
may be automated such that the method can be used as a means for the efficient processing 
of hundreds of images having within each thousands of individual dots. 

20 Although the present disclosure has been described in detail, it should be 

understood that various changes, substitutions, and alterations can be made hereto without 
departing from the spirit and the scope of the invention as defined by the appended claims. 



